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Abstract 

Generalized Yang-Mills theories are constructed, that can use fields 
other than vector as gauge fields. Their geometric interpretation is 
studied. An application to the Glashow-Weinberg-Salam model is 
briefly review, and some related mathematical and physical consider- 
ations are made. 

1 Introduction 

Yang-Mills theories have been tremendously useful in high energy physics, 
where they have served to successfully model the electroweak and strong 
interactions. The fundamental idea of a Yang-Mills theory is that its math- 
ematical expression must be invariant under a local compact Lie group of 
transformations. By local here it is meant that the group element varies 
with the point in Minkowski space that is being considered. In a typical 
physical theory the quantum fields often appear differentiated with respect 
to space or time. This means that, when the field transforms under the local 
Lie group, the differentiation operator is going to act on the transforming 
group element as well as on the field itself, so there is no invariance of this 
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term under the Lie group. Invariance is reestablished substituting the dif- 
ferentiation operators by covariant derivatives. A covariant derivative is the 
sum of the differential operator and the Yang-Mills field. These fields are 
required to transform by adding terms that precisely cancel the extra terms 
brought in by the differentiation operators. Such a transformation is called 
a gauge transformation and the theory is left unmodified by it. Fields that 
perform this kind of service are generically called gauge fields. 

Since the differentiation of a scalar field d^tp, = d/dx^, \i — 0, 1, 2, 3, 
results in a four-vector, the gauge fields have also been taken to be four- 
vectors. In this article we are going to review [|T[ how it is possible to generalize 
the concept of a covariant derivative in a Yang-Mills theory, so that fields 
other than vector can be used as gauge fields. [f2| The idea of putting the 
leptons in a triplet and using the graded group SU (2/1) goes back to Ne'eman 
and Fairlie.[[J The Lie group SU (3) almost has the right quantum numbers to 
embed the Glashow-Weinberg-Salam (GWS) Model in it, but not quite. The 
right group seems to be U(3) in a special representation. An extra boson 
appears, but it automatically uncouples from the rest of the particles. [[Lj 
However, the emphasis of this paper is on the mathematical aspect, rather 
than on high energy applications. 



2 The kinetic energy of a vector field trans- 
forming spinorially 

The quantum electrodynamics Lagrangian is 

C QED = 4)ilpil> - -F^F^ (1) 

where = <9 M + ieA^ and F^ = d^A u - d u A^ = -ie'^D^, D u ], and 
where we are using a metric r]^ u = diag(l, —1, —1, —1). This Lagrangian is 
invariant under a local transformation group based on the U(l) Lie group. If 
U = e~ ta ^ is an element of this group, then the electrically charged fermion 
field transforms as ip — > Uip. The vector field is required to obey the gauge 
transformation law 

A^ + e-\d^a), (2) 
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so that, recalling the definition of D^, the covariant derivative must transform 
as 

D, - UD^U- 1 . (3) 

In the previous equation the derivative that is part of the covariant deriva- 
tive is acting indefinitely to the right. We call such operators unrestrained, 
while the ones that act only on the immediately following object, such as the 
partial in @, we call restrained, and use a parenthesis to emphasize that the 
action of the differentiation operator does not extend any further. We do not 
find it admissible to have unrestrained operators in a Lagrangian, because, 
first, they are not gauge invariant, and, second, what they could physically 
or mathematically mean is not clear. While the operators _D M and D^D V are 
unrestrained, the operator [Z) M , D v \ is restrained, and it is entirely appropri- 
ate that the kinetic energy of a vector boson can be constructed using this 
commutator. The way the commutator becomes restrained is as follows: 

[D„ D u }f = d„A v f - A u d,f - d v A„f + A,d u f = (d,A u )f - (£U M )/, (4) 

where / = f(x) is some differentiable function and it is seen how four unre- 
strained operators result in two restrained ones, thanks to Leibnitz' rule. 

Let S be an element of the spinor representation of the Lorentz group, 
so that, if -0 is a spinor, then it transforms as ip — > Sip. Then, due to the 
homomorphism that exists between the vector and spinor representations of 
the Lorentz group, we have that JJl — > Sj^S^ 1 . It was this homomorphism 
that allowed Dirac to write a spinorial equation that included the vector 
electromagnetic field. The first step in our road to a generalization of the 
covariant derivative will be to rewrite the Lagrangian ([!]) using the spinorial 
representation. To this effect we have the following 

Theorem. Let = 8^ + B^, where is some vector field. Then: 

{d»B v - d u B (t ){d^B v - d»B") = ~ Tr p 2 - ~ Tr p\ (5) 

where the traces are to be taken over the Dirac matrices. 

Proof. Notice the partials on the left of this identity are restrained, the 
ones on the right are not. To prove the theorem it is convenient to use the 
following trick, which makes the algebra manageable, in this and in more 
complicated cases to follow. Consider the differentiable operator O = d 2 + 
2B ■ d + B 2 . Notice that it does not contain any contractions with Dirac 
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matrices, so that Tr O = 40, TrO($$ = 40 (d • B), etc. It is not difficult 
to see then that p = O + where the slashed partial is acting only on 

the succeeding slashed field. The trick is to use this form of p 2 in the traces 
on the right of (|5]). With it one obtains 

\i[o+m\-\^[o + m)? = 2 (o ■ Bf - \ ^ \m m\ 

= (d IM B u -d u B^(d»B»-d»B»)(6) 

as we wished to demonstrate. The motivation for the additional trace term 
is the same as for taking the commutator of the covariant derivatives: to 
ensure that the differential operators be restrained. □ 

With the aid of the Theorem we can rewrite the QED Lagrangian in the 
form 

C QED = $ipl, + e~ 2 (± Tip 2 - 1 - Tr p^j , (7) 

whose Lorentz invariance can be easily proven using p — > SpS^ 1 and the 
cyclic properties of the trace. As an example of the invariance, observe that 
Trp 2 -> TrSpS^SpS- 1 = Tip 2 . 



3 A scalar field functioning as a gauge field 

We are going to construct an example of a theory that employs a scalar 
instead of a vector boson to maintain gauge invariance. To keep things simple 
we use i7(l) as our Lie group, as in the previous section, and so, again, (|3|) 
must hold. This time, however, we take the covariant derivative to be in the 
spinorial representation, as in (0), so it becomes possible to define it to be: 

Z^ = ^-e 7 V (8) 

We now require the gauge field to transform as ^ 5 ip — > 7 5 y? — ie~ x (j)a. These 
equations immediately assure us that D v — > UD^U^ 1 , and so Lagrangian 

C v = ijtD^ + e~ 2 f 1 Tr D 2 - I Tr D^j (9) 

is gauge invariant. The trace terms in this Lagrangian can be simplified 
algebraically and the Lagrangian written in the more traditional form 

C v = i>i^-ei;ij 5 ^ + -(d ti <f)(d^) (10) 
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after a bit of algebra. This last calculation is similar to the one done last 
section, but with the 7 5 taking the place of the 7^'s of that previous calcu- 
lation. 

4 Non-abelian Yang-Mills theory with mixed 
gauge fields 

Consider a Lagrangian that transforms under a non-abelian local Lie group 
that has N generators. The fermion or matter sector of the non-abelian 
Lagrangian has the form ipipip, where is a covariant derivative chosen to 
maintain gauge invariance. This term is invariant under the transformation 
i/j — > Uip, where U — U(x) is an element of the fundamental representation of 
the group. The covariant derivative is = d^ + A^, where A^ = igA^(x)T a 
is an element of the Lie algebra and g is a coupling constant. We are assuming 
here that the set of matrices {T a } is a representation of the groups generators. 
Gauge invariance of the matter term is assured if 

A, - UA^U' 1 - (d^U)U- 1 , (11) 

or, what is the same, 

4 -> UAV' 1 - (QU)U- 1 (12) 

We have already seen how scalar fields can function as gauge fields. Our aim 
in this section is to construct a non-abelian theory that uses both scalar and 
vector gauge fields. We proceed as follows. For every generator in the Lie 
group we choose one gauge field, it does not matter whether vector or scalar. 
As an example, suppose there are N generators in the Lie group; we choose 
the first N v to be associated with an equal number of vector gauge fields 
and the last N$ to be associated with an equal number of scalar gauge fields. 
Naturally Ny + Ng = N. Now we construct a covariant derivative D by 
taking each one of the generators and multiplying it by one of its associated 
gauge fields and summing them together. The result is 

D = ^ + 4 + ^ (13) 

where 

4 = 7 M M = ig-fAp 1 *, a = l,...,N v , 
$ = ^ v = - gi 5 <p b T b , b = N v + 1, . . . , N. 
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Notice the difference between and A a , and between <p and (p b . We 
take the gauge transformation for these fields to be 

4 + $ -> + - (QU)U- 1 , (14) 

from which one can conclude that D — > UDU^ 1 . The following Lagrangian 
is constructed based on the requirements that it should contain only matter 
fields and covariant derivatives, and that it possess both Lorentz and gauge 
invariance: 

C NA = ^Ehj) + ^Tr ^ Tr D 2 - l - Tr D ^ (15) 

where the trace with the tilde is over the Lie group matrices and the one 
without it is over matrices of the spinorial representation of the Lorentz 
group. The additional factor of 1/2 that the traces of ([15]) have with respect 
to @ comes from normalization TrT a T b = |5 a (,, the usual one in non-abelian 
gauge theories. 

Although we have constructed this non-abelian Lagrangian based only on 
the requirements just mentioned, its expansion into component fields results 
in expressions that are traditional in Yang-Mills theories. The reader who 
wishes to make the expansion herself can substitute flHf ) in ATS]), keeping 



in mind the derivatives are unrestrained, and aim first for the intermediate 
result 

J_TrD 2 -iTrD 4 = ((<9 • A) + A 2 ^ — Tr (($4) + 44) 2 

-^Tr(0) + {4,$}) 2 , (16) 

where the curly brackets denote an anticommutator. (We recommend to use 
here the trick explained in section 2.) Notice in this expression that the differ- 
entiation operators are restrained, and that the two different types of gauge 
fields appear only in an anticommutator. The 7 5 in the scalar boson term of 
the generalized covariant derivative D ensures both that the partials become 
restrained and that these anticommutators become commutators once the 
properties of the Clifford matrices are taken into account. Substituting (pf) 
in flTBD and in the matter term of ( |T5D we obtain the non-abelian Lagrangian 
in expanded form: 

C NA = ^ + 4) t p-g^^T b ^ + ^fr(d fl A u -d u A^ + {A fl ,A u ]) 2 

^9 
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+\Tr{d ll <p+[A lt ,<p\f- (17) 

The reader will recognize familiar structures: the first term on the right looks 
like the usual matter term of a gauge theory, the second like a Yukawa term, 
the third like the kinetic energy of vector bosons in a Yang-Mills theory and 
the fourth like the gauge-invariant kinetic energy of scalar bosons in the non- 
abelian adjoint representation. It is also interesting to observe that, if in 
the last term we set the vector bosons equal to zero, then this term simply 
becomes YJ b M \d IJ <p h d ,i <p b , the kinetic energy of the scalar bosons. We have 
constructed a generic non-abelian gauge theory with gauge fields that can be 
either scalar or vector. 

5 Applying these ideas to the GWS Model 

These ideas were applied |J to the GWS Model of high energy physics. The 
Higgs fields of the GWS Model were used, along with the usual vector bosons, 
to construct a generalized covariant derivative. The original intention was to 
use SU{3) as the gauge Lie group, because it generates quantum numbers for 
the particles that are very close to the experimental ones. Eventually it was 
noticed that the complete leptonic and bosonic sectors of the GWS Model 
could be written in the form of Lagrangian flI5|) using the group U(3). 

The covariant derivative D contains the gauge vector bosons of U(l) x 
SU(2), the scalar Higgs bosons, and a new scalar boson. The triplet ip = 
{yLi e L, 6r) contains the leptons and is transformed by the fundamental rep- 
resentation of U(3). All quantum numbers are correctly predicted, and an 
extra scalar boson, but it automatically decouples from the rest of the model 
and is thus unobservable except through its gravitational effects. The repre- 
sentation of the group generators is not the usual one, but instead, a special 
one where the generators are obtained as a linear combination of the usual 
ones. 

The quarks have not been included so far into the scheme. There is one 
term of the pertinent sector of the GWS Model that is not predicted by this 
generalized derivative model, and it is the potential of the Higgs field V (<p) 
that could cause the spontaneous symmetry breaking. 
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6 A generalized curvature 



In a Yang-Mills theory, be it abelian or not, the terms with physical content 
must be gauge invariant and not contain unrestrained derivatives. For exam- 
ple, if is the covariant derivative of a non-abelian theory, the usual expres- 
sion for the kinetic energy of the gauge vector bosons is Trf-D^, D^D 11 , D u ], 
which satisfies both conditions. But even the expression = [D^, D v \ by 
itself does not have any unrestrained derivatives, while Tr has the addi- 
tional property of being gauge invariant. This quantity is the curvature in a 
principal vector bundle. The question arises if similar results as those for F^ 
hold also for a theory with a generalized covariant derivative. The answer 
to this question is in the affirmative. We proceed now to define a quantity 
which we shall call the generalized curvature F. Let D be the generalized 
covariant derivative, as given in ([13]); then: 

F = l^TrD 2 -D 2 , (18) 

or else, in terms of the derivative's constituents, 

F = Id ■ A — ($4) + 1A ■ A - 44 - - {4, $}, (19) 

where 1 is a 4 x 4 unit matrix, so that Tr 1 =4. It can be seen that there are 
no unrestrained derivatives, and clearly Tr F is gauge invariant. 

In the case of a Yang-Mills theory the vector boson kinetic energy can 
be written exclusively in terms of the curvature F^, the commutator of the 
covariant derivative with itself. In the generalized case we study here, the 
kinetic energy can also be written in terms of the quantity F defined above, 
a quantity, that, as previous examples of the curvature, is quadratic in D 
and restrained. The relation between these two quantities is 

- Tr Z) 2 — - Tr 4 = — i Tr F 2 , (20) 
8 2 2 y 1 

that uses only traces for the Dirac matrices. 
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7 The curvature in terms of the covariant 



derivative 

Let us review the curvature concept using Riemannian geometry as example. 
It is well-known that geodesic deviation and parallel transport around an 
infinitesimally small closed curve in a Riemann manifold are two aspects of 
the same construction, and that trivial results do not occur in each case only 
for curved manifolds, [ffl Consider thus a four dimensional Riemann manifold 
with metric and a vector B 11 . The parallel transport of a vector B 13 around 
a closed curve C is given by 

abp = f G re„B'^d8, (21) 

where s is a parametrization of the curve and T^ ucr is the connection in a 
Riemann space, the Christoffel symbol. Let now C be a small parallelo- 
gram made up of two short vectors x^ and y^, so that its area tensor is 
AS^ = 7}X^y u \ The integral can be performed taking the vector field to 
be constant along the sides of the parallelogram, and expressing the values 
of the Christoffel symbol and the vector field at the curve as the first two 
terms of an expansion about, say, the center of the parallelogram. Thus, if 
the values of those quantities at a point in one of the sides of the parallelo- 
gram are T a ps and B 13 , then, for one of the sides, T a ps — F a ps + Y a ps >a \x a 
and B@ + (B 13 a + T f3 (TT B r )^x CT , where the quantities without tilde take their 
values at the center of the parallelogram. The contribution of this side is: 

y s r* ps B% = y 5 (T% d + T a ^ a x a /2)[B^ + (E% + T^ T B^/2] (22) 

The contribution of this and its opposite side is: 

y s T a p S B% +3 = y 5 dx ff \T a fls Bf , t<T + T a p 6 T^ T B T + T a ^ a B% (23) 

Summing over the four sides one obtains 

AB a = -R a MS B p x~<y 5 = -R a p^B 13 AS"< S , (24) 

where R a p~,s = F a p 7> $ — T a ps,-y + Y a s T Y T p^ — T a 1T T T p$ is the Riemann tensor. 
The positive contributions to the tensor come from two opposite sides of 
the parallelogram, and the negative from the other two sides, resulting in 
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structure of commutators that can be seen to arise from a commutator of the 
covariant derivative with itself. 

In the case of a Yang-Mills theory, the vector boson kinetic energy can 
be written exclusively in terms of the curvature F^, the commutator of the 
covariant derivative with itself. The parallel transport of a Yang-Mills field, 
in its role as a connection, around a closed path|| results in the curvature, 
the Yang-Mills connection squared. For two short vectors x M and y M that 
form a parallelogram parallel transport results, as can be shown similarly to 
the way it was done in last paragraph for the Christoffel symbol, in 

|V/ = W, (25) 

which gives a motivation for the interpretation of F^ u as a curvature. 

In the generalized case we have been studying in this paper the kinetic 
energy can also be written in terms of a curvature, precisely the quantity F 
already defined, which was quadratic in D and restrained. The equality does 
not need the taking of the Lie group trace, and one simply has 

-Ti D 2 — -Tr D 4 = — - Tr F 2 . (26) 
8 2 2 

This equality can be shown to be true substituting F as given by fll8D on the 
right in the equation above. Thus the kinetic energy goes as the square of 
the curvature in the generalized case, too. 



8 Studying the geometric background 

Let us study the possible geometric interpretations of the formalism. Let 
then x M and y v be two small vectors that form a parallelogram, as before. To 
calculate the parallel transport, the dot product between vectors has to be 
done taking a trace over the Dirac matrices, but otherwise the procedure is 
the same as before. Using the generalized covariant derivative and the con- 
tracted forms ^ and ^ for the two vectors, and taking the small parallelogram 
as a path to perform the parallel transport integral, one obtains 

\ r YxDDti = F tlv x»y\ (27) 
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where F^ v is constructed as usual with the vector fields that make up the 
covariant derivative. While this result is reasonable, it is unsatisfactory in 
that the scalar fields are left as useless bystanders. 

One possible attempt to use the formalism we have developed to its fullest, 
is adding other terms to ^ and ^. Let then x 5 and y 5 be two numbers and 
define 

d x =$ + a; 5 7 5 and d y =j + y 5 J 5 - (28) 

It is possible now to generalize the parallel transport integral, and substitute 
(j28|) in the trace expression of fl26|) . The result is: 

^Tr DD$i = (d^A v] + [A^A v ))x»y» +{d^+[A^ V \){y^ -x^ 

+d 5 A tl (xY-^y 5 )- (29) 

The geometric interpretation of this equation is straightforward: we are deal- 
ing with a five dimensional manifold. The metric is Trd x d y = 4(x • y + x 5 y 5 ). 

The previous result is interesting from a mathematical point of view, but 
it does not seem to be very enlightening when it comes to an understanding 
of the geometric meaning of the formalism as applied to the GWS Model. 
The reason is that, once we have promoted the linearly independent term 
x 5 7 5 in (|28|) to represent a new dimension, the covariant derivative (|1^) has 



to include an extra term, a derivative with respect to the new dimension, 
and become D = $ + <9s7 5 + 4 + $. This could, in principle, add several 
new terms to the generalized derivative, and we would not have the GWS 
Model anymore. As it is, all the new terms vanish except one; but one term 
alone is bad enough, and we conclude that this five dimensional model cannot 
represent the GWS Model. For the record we present curvature due to the 
generalized derivative with the extra dimension: 

F = d-A-($4)+A-A-44-($<S>)-{4,$}-($ 5 4); (30) 

the sole remaining new term is the last one. 



9 Final remarks 

We have reviewed the construction of a generalized Yang-Mills theory that 
uses fields other than vector as gauge fields. We have shown a possible ge- 
ometric interpretation that requires an additional length parameter, but it 
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requires, to be consistent, an additional term in the covariant derivative with 
respect to the new dimension. The results are interesting from a mathemat- 
ical point of view, but not very helpful if one is trying to understand the 
geometry of the formalism as applied to the GWS Model, since this dimen- 
sion is not observed. 

In this model some of the generators of the Lie group are vectorial, while 
other are scalar. From a mathematical point of view it make no difference 
which ones are which, but if one is aiming to reproduce the GWS Model, 
one is has no free choice in this matter. This is interesting because it relates 
the symmetries of the base manifold to the internal symmetry space of the 
particles, the Lie group. One would expect that eventually an association of 
this kind should follow from first principles. 
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